CDS

Accession Number TCMCG075C26919
gbkey CDS
Protein Id XP_007014413.2
Location complement(join(15826335..15826721,15827761..15827870,15828752..15828827,15828927..15829112,15829755..15829832,15829918..15829998,15830104..15830169,15830270..15830335,15830456..15830506,15831012..15831104,15831205..15831332,15831503..15831641,15831766..15831863,15831997..15832066,15832157..15832246))
Gene LOC18589395
GeneID 18589395
Organism Theobroma cacao

Protein

Length 572aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_007014351.2
Definition PREDICTED: putative clathrin assembly protein At5g57200 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category TU
Description Clathrin assembly protein
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko04131        [VIEW IN KEGG]
KEGG_ko ko:K20043        [VIEW IN KEGG]
ko:K20044        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGGGAACATTTAAGAGCTTTAGAAAGGCTTATGGGGCTCTCAAGGACTCAACCAAGGTTGGCCTTGCAAAGGTCAATAGTGAATTCAAGGATTTAGACATTGCCATTGTAAAAGCCACCAACCATGTTGAATGTCCTCCAAAAGAACGCCATGTTCGAAAAATATTTTCTGCGACATCAGTTGTTCGTCCACGAGCAGATGTGGCATATTGCATTCATGCTTTAGCAAAGAGATTATCCAAGACCCGAAGTTGGATTGTGGCCATAAAGATTCTAATAGTTATTCACAGAACATTAAGAGAAGGTGATCCTACATTTAGAGAAGAGCTTCTAAACTACTCACACAGAGGACATATTCTCCAAATATCAAATTTTAAAGATGATTCAAGCCCTCTTGCATGGGACTGCTCTGCATGGGTTAGGACATATGCACTGTTTTTAGAGGAACGACTTGAATGCTTTAGAGTGTTAAAATACGATATTGAAGCAGAACGCTTGACAAAGTCATCGCCCGGGACAAGCAAGGCACATAGTAGAACAAGGCTTTTGGCTAGTAATGAATTATTGGATCAATTGCCTGCATTACAACAACTTCTTTATCGTCTTGTCGGTTGTGAGCCTGAAGGAGCAGCTTACAGCAATTATCTTGTCCAATACGCCTTAGCTTTGGTATTAAAAGAGAGTTTCAAGATCTATTGTGCTATCAATGATGGAATTATCAATCTTGTGGATATGTTCTTTGATATGTCAAGACATGATGCAGTCAAAGCTCTTAATATGTACAAAAGAGCTGGTCAACAGGCTGAAAATCTTGCTGAATTTTATGAATATTGCAAAGGATTGGATCTTGCTAGGAACTTTCTGTTTCCAACATTAAGACAGCCACCACCATCATTTCTTGCAACAATGGAAGAATATATCAAAGAAGCTCCACAAACAGGCTCTGTCCAAAATAGACTGGAATATGAAGAGAGAGAGCAATCACCTTCAGCACCAGATGAACCTGTAAAAACAGAAAAGCAAGAAGATAAAGTTGAGGAGCCTGAGCCTAAATCATTAATTGATCAAGAAGAGGAACCACAACCTAGGGAGGAACTGGAGGAACCTCAACCACTTATATCGACTGAAAATACAGGAGATTTGTTGGGTCTAAATGAAATAAATCCAAGGGCTTTAGAACTAGAGGAAAGCAATGCATTGGCTCTTGCAATAGTCCCACCAGGCACTGATTCAAGAAATCATGGTATAAGTGAAATTGGTGGTACGGGATGGGAGCTGGCACTTGTTACTACACCAAGCAGCCATACAGCTCCTGTGGTAGAAAGCAAATTGGCGGGTGGATTCGACAAGTTATTACTTGATAGCTTGTATGAAGATGAAGCTGCTAGGAGACAGATTCAATTGACTAATGCAGGATATGGATATGGATATGGATATGAAGGGATGGCTGTGCCAAACCCATTCCAGCAGCAGCATGATCCATTTATGTTGTCTAACAACATTGCTCCCCCAACCAATGTACAAATGGCTTTATTACAAGAGCAAATGATGGTACAACAACAACAGCAGCAAATGATGATGGTACCTTACCAATACCAATCTCACTATCCTCAACAGCCTCAATATCTTCAACAGCAGATCCAAAATCCATTTGGAGACCCATTTTTTAACCTCCCACCAGCTTCAACATCACAACAAGGAAATCATGCACTACTTTAA
Protein:  
MGTFKSFRKAYGALKDSTKVGLAKVNSEFKDLDIAIVKATNHVECPPKERHVRKIFSATSVVRPRADVAYCIHALAKRLSKTRSWIVAIKILIVIHRTLREGDPTFREELLNYSHRGHILQISNFKDDSSPLAWDCSAWVRTYALFLEERLECFRVLKYDIEAERLTKSSPGTSKAHSRTRLLASNELLDQLPALQQLLYRLVGCEPEGAAYSNYLVQYALALVLKESFKIYCAINDGIINLVDMFFDMSRHDAVKALNMYKRAGQQAENLAEFYEYCKGLDLARNFLFPTLRQPPPSFLATMEEYIKEAPQTGSVQNRLEYEEREQSPSAPDEPVKTEKQEDKVEEPEPKSLIDQEEEPQPREELEEPQPLISTENTGDLLGLNEINPRALELEESNALALAIVPPGTDSRNHGISEIGGTGWELALVTTPSSHTAPVVESKLAGGFDKLLLDSLYEDEAARRQIQLTNAGYGYGYGYEGMAVPNPFQQQHDPFMLSNNIAPPTNVQMALLQEQMMVQQQQQQMMMVPYQYQSHYPQQPQYLQQQIQNPFGDPFFNLPPASTSQQGNHALL